Self-organizing map for cluster analysis of a breast cancer database

نویسندگان

  • Mia K. Markey
  • Joseph Y. Lo
  • Georgia D. Tourassi
  • Carey E. Floyd
چکیده

The purpose of this study was to identify and characterize clusters in a heterogeneous breast cancer computer-aided diagnosis database. Identification of subgroups within the database could help elucidate clinical trends and facilitate future model building. A self-organizing map (SOM) was used to identify clusters in a large (2258 cases), heterogeneous computer-aided diagnosis database based on mammographic findings (BI-RADS) and patient age. The resulting clusters were then characterized by their prototypes determined using a constraint satisfaction neural network (CSNN). The clusters showed logical separation of clinical subtypes such as architectural distortions, masses, and calcifications. Moreover, the broad categories of masses and calcifications were stratified into several clusters (seven for masses and three for calcifications). The percent of the cases that were malignant was notably different among the clusters (ranging from 6 to 83%). A feed-forward back-propagation artificial neural network (BP-ANN) was used to identify likely benign lesions that may be candidates for follow up rather than biopsy. The performance of the BP-ANN varied considerably across the clusters identified by the SOM. In particular, a cluster (#6) of mass cases (6% malignant) was identified that accounted for 79% of the recommendations for follow up that would have been made by the BP-ANN. A classification rule based on the profile of cluster #6 performed comparably to the BP-ANN, providing approximately 25% specificity at 98% sensitivity. This performance was demonstrated to generalize to a large (2177) set of cases held-out for model validation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Modfied Self-organizing Map Neural Network to Recognize Multi-font Printed Persian Numerals (RESEARCH NOTE)

This paper proposes a new method to distinguish the printed digits, regardless of font and size, using neural networks.Unlike our proposed method, existing neural network based techniques are only able to recognize the trained fonts. These methods need a large database containing digits in various fonts. New fonts are often introduced to the public, which may not be truly recognized by the Opti...

متن کامل

Semantic Correspondence of Database Schema from Heterogeneous Databases using Self-Organizing Map

This paper provides a framework for semantic correspondence of heterogeneous databases using selforganizing map. It solves the problem of overlapping between different databases due to their different schemas. Clustering technique using self-organizing maps (SOM) is tested and evaluated to assess its performance when using different kinds of data. Preprocessing of database is performed prior to...

متن کامل

Breast Cancer Analysis using Independent Component Analysis (ICA) and Self Organizing Map (SOM)

A method for discrimination and classification of breast cancer dataset with benign and malignant tissues is proposed using Independent Component Analysis (ICA) and Self Organizing Map (SOM). The method implement ICA for preprocessing and data reduction and SOM for data analysis. The best performance was obtained with ICASOM, resulting in 98.8% classification accuracy and a SOM result is 94.9%.

متن کامل

Lifestyle patterns in the Iranian population: Self- organizing map application

Background: The present study evaluated the lifestyle behavior patterns and its associations with demographic factors in the Iranian population. Methods: A total of 8244 people aged 25-70 years who participated in a national survey in 2011 were included in the study. Factors related to lifestyle (such as diet, physical activity, and tobacco use) have been collected using a questionnaire. A sel...

متن کامل

Application of a Self-Organizing Map for Clustering the Groundwater Quality in Kerman Province and Assessment its Suitability for Drinking and Irrigation Purposes

Evaluation of groundwater hydro chemical characteristics is necessary for planning and water resources management in terms of quality. In the present study, a self-organizing map (SOM) clustering technique was used to recognize the homogeneous clusters of hydro chemical parameters in water resources (including well, spring and qanat) of Kerman province; then, the quality classification of groun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Artificial intelligence in medicine

دوره 27 2  شماره 

صفحات  -

تاریخ انتشار 2003